Type-based MCMC for Sampling Tree Fragments from Forests
نویسندگان
چکیده
This paper applies type-based Markov Chain Monte Carlo (MCMC) algorithms to the problem of learning Synchronous Context-Free Grammar (SCFG) rules from a forest that represents all possible rules consistent with a fixed word alignment. While type-based MCMC has been shown to be effective in a number of NLP applications, our setting, where the tree structure of the sentence is itself a hidden variable, presents a number of challenges to type-based inference. We describe methods for defining variable types and efficiently indexing variables in order to overcome these challenges. These methods lead to improvements in both log likelihood and BLEU score in our experiments.
منابع مشابه
Sampling Tree Fragments from Forests
We study the problem of sampling trees from forests, in the setting where probabilities for each tree may be a function of arbitrarily large tree fragments. This setting extends recent work for sampling to learn Tree Substitution Grammars to the case where the tree structure (TSG derived tree) is not fixed. We develop a Markov chain Monte Carlo algorithm which corrects for the bias introduced b...
متن کاملForest Stand Types Classification Using Tree-Based Algorithms and SPOT-HRG Data
Forest types mapping, is one of the most necessary elements in the forest management and silviculture treatments. Traditional methods such as field surveys are almost time-consuming and cost-intensive. Improvements in remote sensing data sources and classification –estimation methods are preparing new opportunities for obtaining more accurate forest biophysical attributes maps. This research co...
متن کاملComparison of Machine Learning Algorithms for Broad Leaf Species Classification Using UAV-RGB Images
Abstract: Knowing the tree species combination of forests provides valuable information for studying the forest’s economic value, fire risk assessment, biodiversity monitoring, and wildlife habitat improvement. Fieldwork is often time-consuming and labor-required, free satellite data are available in coarse resolution and the use of manned aircraft is relatively costly. Recently, unmanned aeria...
متن کاملGuidelines for Sampling Aboveground Biomass and Carbon in Mature Central Hardwood Forests
—As impacts of climate change expand, determining accurate measures of forest biomass and associated carbon storage in forests is critical. We present sampling guidance for 12 combinations of percent error, plot size, and alpha levels by disturbance regime to help determine the optimal size of plots to estimate aboveground biomass and carbon in an old-growth Central Hardwood forest. The analyse...
متن کاملDetermination of Spatial Distribution Pattern Analysis of Acer Velutinum Species in two Elevation Classes using Distance Sampling Methods (Case Study: Asalem Nav Forests, Series No. 2)
One of the important features of plant communities is the spatial pattern of trees. The spatial pattern of the stands determined by measuring and positioning of trees in the stands and inserting them in analytical frameworks. This is because spatial information allows natural resource managers to make and perform better-informed decisions, -. The aim of this study was to assess the spatial patt...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014